ISODATA clustering with parameter (threshold for merge and split) estimation based on GA: Genetic Algorithm
نویسنده
چکیده
A method of GA: Genetic Algorithm based ISODATA clustering is proposed. GA clustering is now widely available. One of the problems for GA clustering is a poor clustering performance due to the assumption that clusters are represented as convex functions. Well known ISODATA clustering has parameters of threshold for merge and split. The parameters have to be determined without any assumption (convex functions). In order to determine the parameters, GA is utilized. Through comparatives studies between with and without parameter estimation with GA utilizing well known UCI Repository data clustering performance evaluation, it is found that the proposed method is superior to the original ISODATA. It is found that the experimental results show that clustering error of the proposed method is 2 to 10 times much smaller than that of the existing method. It is also found that the elite selection strategy is superior to the average selection through experiments.
منابع مشابه
Comparative Study between the Proposed GA Based ISODAT Clustering and the Conventional Clustering Methods
A method of GA: Genetic Algorithm based ISODATA clustering is proposed.GA clustering is now widely available. One of the problems for GA clustering is a poor clustering performance due to the assumption that clusters are represented as convex functions. Well known ISODATA clustering has parameters of threshold for merge and split. The parameters have to be determined without any assumption (con...
متن کاملGenetic Algorithm Utilizing Image Clustering with Merge and Split Processes Which Allows Minimizing Fisher Distance Between Clusters
Genetic algorithm utilizing image clustering with merge and split processes which allows minimizing Fisher distance between clusters is proposed. Through experiments with simulation and real remote sensing satellite imagery data, it is found that the proposed clustering method is superior to the conventional k-means and ISODATA clustering methods in comparison to the geographic maps and classif...
متن کاملNBCT: A Toolkit for Naive Bayes Clustering
3 Clustering algorithms 5 3.1 ML/MAP based clustering . . . . . . . . . . . . . . 5 3.1.1 Parameter estimation based on ML . . . . . . 5 3.1.2 Parameter estimation based on MAP . . . . . 5 3.1.3 Membership distribution . . . . . . . . . . . 6 3.1.4 Dissimilarity . . . . . . . . . . . . . . . . . 7 3.1.5 Clustering . . . . . . . . . . . . . . . . . . . 7 3.1.6 Relevance analysis . . . . . . . . ...
متن کاملIdentification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملEstimation of groundwater level using a hybrid genetic algorithm-neural network
In this paper, we present an application of evolved neural networks using a real coded genetic algorithm for simulations of monthly groundwater levels in a coastal aquifer located in the Shabestar Plain, Iran. After initializing the model with groundwater elevations observed at a given time, the developed hybrid genetic algorithm-back propagation (GA-BP) should be able to reproduce groundwater ...
متن کامل